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Abstract — The conventional assumption made in the design 
of communication systems is that the energy used to transfer 
information between a sender and a recipient cannot be reused 
for future communication tasks. A notable exception to this 
norm is given by passive RFID systems, in which a reader can 
transfer both information and energy via the transmitted radio 
signal. Conceivably, any system that exchanges information via 
the transfer of given physical resources (radio waves, particles, 
qubits) can potentially reuse, at least part, of the received 
resources for communication later on. In this paper, a two-way 
communication system is considered that operates with a given 
initial number of physical resources, referred to as energy units. 
The energy units are not replenished from outside the system, 
and are assumed, for simplicity, to be constant over time. A node 
can either send an "on" symbol (or "1"), which costs one unit of 
energy, or an "off" signal (or "0"), which does not require any 
energy expenditure. Upon reception of a "1" signal, the recipient 
node "harvests" the energy contained in the signal and stores it 
for future communication tasks. Inner and outer bounds on the 
achievable rates are derived, and shown via numerical results to 
coincide if the number of energy units is large enough. 

I. Introduction 

The conventional assumption made in the design of commu- 
nication systems is that the energy used to transfer information 
between a sender and a recipient cannot be reused for future 
communication tasks. A notable exception to this norm is 
given by passive RFID systems, in which a reader can transfer 
both information and energy via the transmitted radio signal. 
Upon reception of the radio signal from the reader, a passive 
RFID tag modulates information by backscattering the radio 
energy received from the reader (see, e.g., []2]). Another, less 
conventional, example is that of a biological system in which 
information is exchanged via the transmission of particles (see, 
e.g., |4|), which can be later reused for successive commu- 
nication tasks. More in general, any system that exchanges 
information via the transfer of given physical resources (radio 
waves, particles, qubits) can conceivably reuse, at least part, 
of the received resources for later communication tasks. 

This paper is motivated by the examples above to consider a 
two-way communication system [3 | that operates with a given 
initial number of physical resources, which we will refer to as 
energy units (see Fig. [TJ. The energy units are not replenished 
from outside the system and can only decrease with time. To 
simplify the analysis, assume that the two parties involved 
have a common clock and that, at each time, a node can 
either send an "on" symbol (or "1"), which costs one unit 
of energy, or an "off" signal (or "0"), which does not require 
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Figure 1. Two-way noiseless binary communication with energy exchange. 
The total number of energy units is fixed (to five in the figure) and transmission 
of a "1" symbol transfers energy from the sender node to the recipient. 



any energy expenditure. Upon reception of a "1" signal, the 
recipient node "harvests" the energy contained in the signal 
and stores it for future communication tasks. In general, such 
harvesting process can incur an energy loss. 

Furthermore, let us assume that the binary channel in either 
direction is noiseless. Clearly, if there were no limitation on 
the number of energy units, the nodes could communicate 1 
bit per channel use in either direction given that the channels 
are ideal. However, consider now the case with a single energy 
unit available in the system. Moreover, assume that there are 
no energy losses so that when a "1" is received, one energy 
unit is retrieved at the recipient. Then, at any time instant, 
the energy unit is available at either Node 1 or at Node 2. It 
follows that only the node that currently possesses the energy 
unit can transmit a "1", whereas the other node is forced to 
transmit a "0". Therefore, the design of the communication 
strategy at the nodes should aim not only at transferring 
the most information to the counterpart, but also to facilitate 
energy transfer. Due to the constraints on the available physical 
resources for transmission, it is expected that the maximum 
sum-rate of two, achievable with no energy limitations, cannot 
be attained with a limited number of energy units. It is this 
trade-off that we are interested in studying in this paper. 

A. Contributions and Related Work 

In this paper, we will focus on the simple two-way binary 
noiseless model illustrated above and assume that the initial 
number of energy units is given, and can be transferred as 
discussed above upon transmission of a "1" symbol. We will 
assume, for simplicity, that there are no energy losses in the 
system and investigate the set of rate pairs achievable for any 
given number of energy units. Specifically, inner and outer 
bounds on the achievable rates are derived, and shown via 
numerical results to coincide if the number of energy units is 
large enough. 



A few additional remarks are in order. The model that we 
consider here is different from a conventional setting in which 
a conventional total average cost constraint is imposed on the 
two nodes (see e.g., Hi). Indeed, a cost constraint does not 
entail any memory in the system, whereas, in the set-up at 
hand of information exchange, the current transmissions affect 
the available energy in the next time instants. The model is 
more related to recent works that analyze communication in 
the presence of energy harvesting (see [5 | and the references 
therein). However, in those works, one assumes that the 
energy is harvested from the environment in a way that is 
not affected by the communication process. Instead, here, the 
energy available at a node depends on the previous actions of 
all other nodes involved in the communication process. 

Notation: [m, n] — {m, m + 1, n} for integers m < n; N 
is the set of integer numbers; Notations H(X) and H{p(x)) 
are both used to denote the entropy of a random vector with 
distribution p(x); If the distribution is Bern(p) we will also 
write H(p) for the entropy. Capital letters denote random 
variables and the corresponding lowercase quantities denote 
specific values of the random variables. 

II. System Model 

We consider the binary and noiseless two-way system 
illustrated in Fig. 1, in which the total number of energy 
units in the system is limited to a finite integer number 
U > 1. At any given time instant i, with i G [l,n], the 
state of the system {U\,u S N 2 is given by the current 
energy allocation between the two nodes. Specifically, a state 
(Ui,i, C^2,i) indicates that at the z-th channel use there are Ujj 
energy units at Node j, with j = 1,2. Since we assume that 
Ui.i + U2A = U for each channel use i G [1, n] (i.e., no energy 
losses occur), then we will refer to U\^ as the state of the 
system, which always imply the equality Ui.i = U — U\,i- 

At any channel use i G [1, n], each Node j can transmit 
either symbol Xjj — or symbol Xjj — 1, and transmission 
of a "1" costs one energy unit, while symbol "0" does 
not require any energy expenditure. Therefore, the available 
transmission alphabet for Node j, j = 1,2 during the i— th 
channel use is 

X u = {0, 1} if U jti = u > 1 
and X = {0} if U j4 = 0, (1) 

so that Xj t i G X u if Uj i = u energy units are available at 
Node j. The channel is noiseless so that the received signals 
at channel use i are given by 

Y lti = X 2li and Y 3 ,i =X lfi (2) 

for Node 1 and Node 2, respectively. 

Transmission of a "1" transfers one energy unit from the 
sender node to the recipient node. Therefore, the state of Node 
1 for i G [l,n] evolves as follows 

Uia — (Ui t i-i ~ Xi : i^i) + + X2,i_i, (3) 

where we set U\,i — Ui t % < U as some initial state and 
Ui.i = U — Ui t i. We observe that the current state U\ t i is 



a deterministic function of the number U of total energy units, 
of the initial state Ux i and of the previously transmitted 
signals X^ 1 and X%~ . We also note that both nodes are 
clearly aware of the state of the system at each time since 
Ui.i + Ui.i = U is satisfied for each channel use i. 

Node 1 has message Mi, uniformly distributed in the set 
[l,2 ni?1 ], to communicate to Node 2, and similarly for the 
message A/2 G [l,2 ,li?2 ] to be communicated between Node 
2 and Node 1. Parameters R\ and R2 are the transmission 
rates in bits per channel use (c.u.) for Node 1 and for 
Node 2, respectively. We use the following definitions for an 
(n, R\ , i?2 > U) code. Specifically, the code is defined by: the 
overall number of energy units U; two sequences of encoding 
functions, namely, for Node 1, we have functions fx j for 
i G [l,n], which map the message Mi and the past received 
symbols A^ -1 (along with the initial state) into the currently 
transmitted signal X\ t i G Xjj x i ; similarly, for Node 2, we have 
functions i-i.i for i G [l,n], which map the message M2 and 
the past received symbols X^ 1 (along with the initial state) 
into the currently transmitted signal X^.i G Xjj 2i ; and two 
decoding functions, namely, for Node 1, we have a function gi, 
which maps all received signals XV; and the local message M\ 
into an estimate M\ of message Mi\ and similarly, for Node 
2, we have a function g2, which maps all received signals X" 
and the local message M2 into an estimate Mi of message 
Mi. We require that the estimated messages be equal to the 
true messages given the noiseless nature of the channel for 
any initial state U\,\ = < U. 

We say that rates (i?i , R2) are achievable with U energy 
units if there exists an (n, R%, R%, u) code for all sufficiently 
large n. We are interested in studying the closure of the set of 
all the rate pairs (i?i,i?2) that are achievable with U energy 
units, which we refer to as capacity region C(u). 

III. Achievable Rates 

In this section, we consider various communication strate- 
gies. We start by the simplest, but intuitively important, case 
with U= 1, and we then generalize to U> 1. 

A. U= 1 Energy Unit 

We start with the special case of one energy unit (U = 1) 
and assume the initial state Ui t x — 1, so that the energy unit is 
initially available at Node 1. The other case, namely u% x = 0, 
can be treated in a symmetric way. In this setting, during each 
channel use, "information" can be transferred only from the 
node where the energy unit resides towards the other node, 
and not vice versa, since the other node is forced to transmits 
the "0" symbol. This suggests that, when U = 1, the channel is 
necessarily used in a time-sharing manner, and thus the sum- 
rate is at most one bit per channel use. The first question is 
whether the sum-rate of 1 bit/c.u. is achievable, and, if so, 
which strategy accomplishes this task. 

1) A Naive Strategy: We start with a rather naive encoding 
strategy that turns out to be insufficient to achieve the upper 
bound of 1 bit/c.u.. The nodes agree on a frame size F — 2 b > 
1 channel uses for some integer b and partition the n channel 



uses in n/F frames (assumed to be an integer for simplicity). 
The node that has the energy unit at the beginning of the frame 
communicates b = log 2 F bits to the other node by placing 
the energy unit in one specific channel use among the F = 2 b 
of the frame. This process also transfers the energy unit to 
the other node, and the procedure is repeated. The sum-rate 
achieved by this scheme is 

Ri + R2 = [bits/c.u.], (4) 

r 

which is rather inefficient: the maximum is achieved with F = 
2, leading to a sum-rate of R± + R2 = 1/2 bits/ c.u.. 

The previous strategy can be easily improved by noting 
that the frame can be interrupted after the channel use in 
which the energy unit is used, since the receiving node can 
still decode the transmitted b bits. This strategy corresponds 
to using a variable-length channel code. Specifically, we can 
assign, without loss of optimality within this class of strategies, 
the codeword "01" to information bit "0" and the codeword 
"1" to bit "1". The average number of channel uses per bit 
is thus 1/2 + 1/2-2 = 3/2 . Therefore, the overall number 
of channel uses necessary for the transmission of m bits is 
upper bounded by + me with arbitrarily small probability 
for large m by the weak law of large numbers (see, e.g., fl]). 
It follows that an achievable sum-rate is given by 

Rl+R2 = J/2 = l (5) 

which is still lower than the upper bound of 1 bit/c.u.. 

2) An Optimal Strategy: We now discuss a strategy that 
achieves the upper bound of 1 bit/c.u.. The procedure is based 
on time-sharing, as driven by the transfer of the energy unit 
from one to the other node. Specifically, each Node j has 
m bits of information bj t \, bj m . Since the initial state is 
u\ : i = 1, Node 1 is the first to transmit: it sends its information 
bits, starting with 61.1 up until the first bit that equals "1". 
Specifically, assume that we have 61,1 = 61.2 = • ■ • ^i,u-i = 
and bn ± = 1. Thus, in the i x — th channel use the energy unit 
is transferred to Node 2. From the [it + 1)— th channel use, 
Node 2 then starts sending its first bit b 2 ,i and the following 
bits until the first bit equal to "1". The process is then repeated. 
It is easy to see that the total time required to finalize this two- 
way communication is constant and equal to 2m and thus the 
achieved sum-rate is equal to R\ + R2 = 1 bit/c.u.. 

B. U > 1 Energy Units 

In the sum-capacity strategy discussed above with U = 1 
energy unit, both nodes transmit equiprobable symbols "0" 
and "1". When there are U > 1 energy units in the system, 
maximizing the sum-capacity generally requires a different 
approach. Consider the scenario with U = 2 energy units: 
now it can happen that both energy units are available at one 
node, say Node 1. While Node 1 would prefer to transmit 
equiprobable symbols "0" and "1" in order to maximize the 
information flow to the recipient, one must now also consider 
the energy flow: privileging transmission of a "1" over that of 
a "0" makes it possible to transfer energy to Node 2, leading 



to a state in which both nodes have energy for the next channel 
use. This might be beneficial in terms of achievable sum-rate. 

Based on this insight, in the following, we propose a coding 
strategy that employs rate splitting and codebook multiplexing. 
The strategy is a natural extension of the baseline approach 
discussed above for the case U = 1. Each Node j constructs 
U codebooks, namely Cji u , with u 6 [l,u], where codebook 
Cji u is to be used when the Node j has u energy units. 
Each codebook Cj\ u is composed of codewords that all have 
a specific fraction pu u of "1" symbols. The main idea is that, 
when the number u of available energy units is large, one 
might prefer to use a codebook with a larger fraction pu u of 
"1" symbols in order to facilitate energy transfer. 

Proposition 1. The rate pair {R\,R2) satisfying 

U 

R\ < ^2^uH{pi\ u ) 

u 

and R 2 < ir u H(p 2 \ u ) (6) 

for some probabilities < Pi\ u -,p 2 \ u < 1> u = 1 • ■ ■ U, with 
Pi|o = P2|U = 0> i s included in the capacity region C(U), 
where the probabilities n u > 0, u = . . . U satisfy the fixed- 
point equations 

T-u — T u (<^>o,0|-u ~t~ T11— 100, 1|« "I" 7r «+l'/ > l,0|u (7) 

with 7r_i = 7ru +1 = 0, 2~Zu=i 7r « = 1> an d we have defined 

<?Vo|u = (l-Pl|u)(l-P2|U-J 
4>0,l\u = (1 -Pl|u)P2|U-u 
01,Q|u = Pl|u(l -P2|U-«) 

and t)>x,\\u = Pi|uP2|U-u- (8) 

This proposition is proved by resorting to random coding argu- 
ments, whereby codebook Cj\ u is generated with independent 
and identically distributed (i.i.d.) Bern(pj| u ). As introduced 
above, the idea is that, when the state is U\.i = u, Node 
j transmits a symbol from the codebook associated with that 
state, namely codebook C^ u for Node 1 and codebook C 2 \xj- U 
for Node 2 (which has U — u energy units). Both nodes know 
the current state U\ : i and thus can demultiplex the codebooks 
at the receiver side. According to the random coding argument, 
the state f/i,, evolves according to a Markov chain: the system 
stays in the same state u with probability 0o,o|« +<^i i|« (both 
nodes transmit "0" or "1"), changes to the state u + 1 with 
probability <fii.o\ u (Node 1 transmits a "1" and Node 2 a "0") 
or changes to the state u — 1 with probability </>o.i|u (Node 
1 transmits a "0" and Node 2 a "1"). The definition of the 
conditional probabilities ([8]l reflects the fact that the codebooks 
are generated independently by the two nodes. A full proof is 
given in Appendix |A] 

IV. Outer Bounds 

In this section, we derive an outer bound to the capac- 
ity region C(u). To set up the notation, for a given joint 



distribution 4> XltX2 \ u > 0, conditional on some value u, 
with £1,2:2 G {6,1} and Exi,x 2 e{o,i} ^ u =c 2 \u = 1, we 
define the marginal distributions (f> Xl \ u = 2~2 X2 =q ( t ) x l ,x 2 \u an d 
4>x 2 \u = Exi=o < ^xi,s 2 |«' ^ e conditional distributions 

0xi|x 2 ,u = ^Xi,x 2 |u/<^x 2 |u and ^x 2 |xi,u = ^xi.xsslu/^xxlw f° r 
Zl,X 2 G {0, 1}. 

Proposition 2. If the rate pair (R\,R 2 ) is included in 
the capacity region C(u), then there exist probabilities 
ir u > with 2«=i 7r « = !> an< ^ 0xi^r 2 |u > witn 
Exi,x 2 e{o,i} ^i,x 2 |« = 1 /or a// u G {0, 1, u}, such that 
<t>l,x 2 \o = Q f or x z e {0, 1}, <p Xl ,i\u = for xi G {0,1}, 
condition is satisfied, and the following inequalities hold 

u 1 

Rl < ^2 Wu X] ^2|«# (01|x 2 ,u) ( 9 ) 

U=0 £2=0 

(7 1 

«2 < ^ T« X ai | U ^ falx!,*) (10) 

li — 3^1— 

(7 

and R x + R 2 <^2tt u H ((t) Xl X ^ u ) . (11) 

u=0 

The outer bound above can be interpreted as follows. Suppose 
that, when the state is U\ t i = u, the nodes were allowed 
to choose their transmitted symbols according to a joint 
distribution 4>x 1 ,x 2 \u = PrfX^j = xi, X 2i — x 2 }. Note that 
this is unlike the achievable strategy described in the previous 
section in which the codebook were generated independently. 
Intuitively, allowing for correlated codebooks, leads to a larger 
achievable rate region, as formalized by Proposition 2, whose 
proof can be found in Appendix B. 

V. Numerical Results 

Fig. |2] compares the achievable sum-rate obtained from 
Proposition 1 and the upper bound (fTTT l on the sum-rate 
obtained from Proposition 2 versus the total number of energy 
units U. As for the achievable sum-rate, we consider both a 
conventional codebook design in which the same probability 
Pj\u — 0.5 is used irrespective of the state U\.i = u, and 
one in which the probabilities pj\ u are optimized. It can be 
seen that using conventional codebooks, which only aim at 
maximizing information flow on a single link, leads to sub- 
stantial performance loss. Instead, the proposed strategy with 
optimized probabilities Pji u , which account also for the need 
to manage the energy flow in the two-way communication 
system, performs close to the upper bound. The latter is indeed 
achieved when U is large enough. 

A remark on the optimal probabilities p*^ u is in order. Due 
to symmetry, it can be seen that we have p^ u = P 2 \tj_ u - 
Moreover, numerical results show that p*, increases mono- 
tonically as u goes from to U, such that p* y > 0.5. In 
particular, when the number of states U + 1 is odd, it holds 
that p* yy 2 = p* ,j^ 2 = 0.5. It is finally noted that the energy 
neutral transitions (both nodes emitting "0" or both emitting 
"1") occur with equal probability (i.e., (1 — p* M )(l u ) = 
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Figure 2. Achievable sum-rate obtained from Proposition 1 and upper bound 
y It versus the total number of energy units U. 

VI. Conclusion 

In resource-constrained systems in which the resources 
(e.g., energy) used for communication can be "recycled", 
new performance trade-offs arise due the need to balance 
the maximization of the information flow with the resulting 
resource exchange among the communicating nodes. This 
paper has illustrated this aspect by studying a simple two-way 
communication scenario with noiseless channels and limited 
resources. Various extensions of this work are of interest, 
including a generalization to a scenario with energy losses. 

Appendix A 
Proof of Proposition 1 

1) Code construction: We generate U codebooks for each 
Node j = 1,2, namely Cj\ u , with u G [1,U]. The code- 
book Cj\ u for u > has Kj tU codewords, each consisting 
of rij }U symbols Xj, u ,l G {0,1}, which are randomly and 
independently generated as Bern(pj|„) variables, with I = 
1,2, ...,n,j tU and n,j iU — nSj jU , for some < 5j. u < 1. We 
denote the codewords as x" 3 u M (m J)U ) with rrij tU G [1, -ft^J. 
Note that the parameter 6j. u does not depend on n, and hence, 
if n —> 00, then we have rij tU — > 00 for all j, u. We set 
2"flj _ rj u=1 Kj tU , while the relations among the remaining 
parameters (Kj tU ,Sj tU ,Pji u ) will be specified below. 

2) Encoding: Each node performs rate splitting. Namely, 
given a message Mj € [1,2"^], Node j finds a U— tuple 
[rrij t \, rrij rj] with m^ u G [l,-Kj )U ] that uniquely repre- 
sents Mj. This is always possible since we have 2 nRj = 
Yl u —iKj tU , Then, the selected codewords x™ J u "(mj tU ) for 
u G [1,U] are transmitted via multiplexing based on the 
current available energy. Specifically, each Node j initializes 
U pointers = l^ 2 = ■ ■ ■ = = 1 that keep track of the 
number of symbols already sent from codewords x™^ 1 (m^i), 

Ti TT 

x™ 2 2 (rrij t2 ), x^jj (rriju), respectively. At channel use i, 
if the state is Un = u, then the nodes operate as follows. 



• Node 1: If u = 0, then x\ ;L — 0. Else, if l\, u < ni iU , 
Node 1 transmits x\.i = Xi, u ,h u (mi lU ) an d increments 
the pointer h iU by 1. Finally, if l\ u — ni„ + l the pointer 
Vx iU is not incremented, and the transmitter uses random 
padding, i.e., it sends xx i = 1 with probability pi tU and 
x\.i = otherwise. 

• Node 2: If it = U (i.e., no energy is available at Node 2), 
then X2,i = 0. Else, if l 2 .jj- u ^ n 2 U-u' Node 2 transmits 
Z2,» = i2,U-u,2,z 2U u ( m 2,U-u) and increments the 
pointer 1 2> jj_ u by l'. Finally, if l 2lJ _ u = n 2 .U-« + L 
the pointer Z 2 u_ u is not incremented, and Node 2 sends 
%2,i = 1 with probability p 2 xj- u an d ^2.4 = otherwise. 

The random padding method used above is done for technical 
reasons that will be clarified below. 

3) Decoding: We first describe the decoding strategy for 
Node 2. By construction, the nodes are aware of the state 
sequence U", and thus can determine the ordered set 

M u = {i\Ui,i = u}, (12) 

of channel use indices in which the state is it with u S [0,U]. 
For all u G [1,U], if \Af u \ > m, u , then Node 2 takes the 



first n% u indices i u i < i U 2 < 



from the set 



M u and obtains the list of messages m\ u G that 
satisfy Xx u k( m l,u) — Xx,i a k for all k G [l,n ltl ]. Note that 
the list cannot be empty due to the fact that the channel is 
noiseless. However, it contains more than one message, or if 
\Nu\ < ni.u, then Node 2 puts out the estimate rhi :U = 1. 
Instead, if the list contains only one message mi, u , then Node 
2 sets mi jtl = mi „. Finally, the message estimate is obtained 
as rh\ = [rhi t i, m^u]. 

Node 1 operates in the same way, with the only caveat that 
the itth codebook C 2 \ u of Node 2 is observed at channel uses 
in the set Nu-u for u G [1, U]. 

4) Analysis: We evaluate the probability of error on average 
over the messages and the generation of the codebooks, 
following the random coding principle. From the definition of 
the decoders given above, the event that any of the decoders is 
in error is included in the set £ = \J j=1 2 Uu=i(^i,« u 
where: (i) fj 1 ^ is the event that \N U \ < ni lU for j ' = 1 and 
that |A/u_J < n 2 . u for j = 2, that is, that the number of 
channel uses in which the system resides in the state in which 



the codeword x 



( m j,u) from the codebook Cj u is sent is 

(2) 



not sufficient to transmit the codeword in full; (if) £\ u is 
the event that two different messages w' u ,m'' u G [l,ifj )U ] 
are represented by the same codewords, i.e., (m'j u ) = 

The probability of error can thus be upper bounded as 

2 U 

p r[£] < E E ( Pr [4«] + Pr t4«]) • ^ 

3=1 u=l 

In the following, we evaluate upper bounds on this terms. 
It immediately follows from the packing lemma of 1 1 1 that 

Pr[£ - 2 J] — > as rij M — >• oo as long as 



log 2 Kj 



3? u 



< H( Pjlu )-S{e) 



(14) 



with (5(e) — > as e — > 0. For analysis of the probabilities 
Pr^jy], we observe that, under the probability measure 
induced by the described random codes, the evolution of the 
state Uij across the channel uses i G [l,n] is a Markov 
chain with U + 1 states. Specifically, the chain is a birth-death 
process, since, if the state is Uij = u in channel use i, the next 
state C/i,i+i can only be either it— 1 or u+1. More precisely, let 
Qu\w — Pr(t r i,i+i = u\U\.i — w) be the transition probability. 
Note that, due to the use of random padding, the transition 
probability q u \ w remains constant during all n channel uses, 
so that the Markov chain is time-invariant. 

We now elaborate on the Markov chain U\^. To this end, 
we first define as <f> XltX2 \ u , where x\,x 2 G {0, 1} be the joint 
probability that Node 1 transmits X\,% — x% and Node 2 
transmits X 2 j — x 2 during the i— th channel use in which 
the state is Un = it. Specifically, from the way in which the 
codebooks are generated, we have (). We can now write the 
non-zero values of the transition probability q u i w as follows: 



Qu,u-1 — 01,01m Qum+1 — <P0,l\u 

Qu,u = 1 — 1u,u-l — 1u,u+l (15) 

With a slight abuse of the notation and noting that 1O |o = 
^i,i|o = an d 0o,i|U = ^i,i|U = the expressions above also 
represent the transitions for the two extremal states it = and 
u = U, as they imply <j<o|-i = and gu|U+i = 0- 

If pifl = P2,o = and < pi, u ,P2,u < 1 for all it > 0, 
then it can be seen that the Markov chain is aperiodic and 
irreducible, and thus there exist a unique set of stationary 
probabilities 7To, tti, • • • , tt\j, which are given by solving the 
linear system, defined by taking U equations of type Q for 



0...U 



1. 



1 and adding the condition X)«=o "« 
We are now interested in the statistical properties of the set 
\Af u \ of channel uses in which the state satisfies U% = u. Using 
the ergodic theorem and the strong law of large numbers Il6] 
Theorem 1.10.2], it can be shown that linin^oo = 7r u 

with probability 1. Therefore, if we choose: 



(16) 



then Pr[f}%] = Pr[^^_ u ] can be made arbitrarily close to 
as n — > oo. This concludes the proof. 



Appendix B 
Proof of Proposition 2 



Consider any (n, R\, R2, U) code with zero probability of 
error, as per our definition of achievability in Sec. [I] We have 



the following inequalities: 

nRi = H(M X ) = H{Mx\M 2 , U hl = u hl ) 
( =' H{Mi,X%, U?\M a , Ui,i = u hl ) 
( ^H(X?,U?\M 2 , U 1A = m A ) (17) 

n 

= H{x x , h u^\x{-\u[-\m 2 , u hl = u u ) 
»=i 

n 

= J2 H ( u i,i\ x t\Ut 1 ,M 2 ,U hl =ui,i) (18) 
+ H(X 1}i \Xt 1 ,Ui,M 2 ,U 1A = u 1 ,i) (19) 

n 

( = ) ^if(X M |Xj" 1 ,?7 1 i ,M 2 , C/i,i = tii,i) (20) 

(c£) n 

< ^^(Xi^C/i^Xa.i) (21) 

^fTp^c/i^Q) (22) 
<fr(Xi|^,x 2 ), (23) 

where (a) follows since X", {/" are functions of Mi,M 2 and 
u lt i; (b) follows since fl"(Mi|Xf , U?,M 2 , U x ,i = = 
holds due to the constraint of zero probability of error; 
(c) follows since Ux,i is a function of X\ ,M 2 and iti i; 
(J) follows by conditioning reduces entropy; (e) follows by 
defining a variable Q uniformly distributed in the set [l,n] 
and independent of all other variables, along with X\ = X\q, 
X 2 = X 2 q and U x = U 1Q . 

Similar for nR 2 we obtain the bound nRi < 
H{X\\U\,X 2 ). Moreover, for the sum-rate, similar steps lead 
to 

n{R x + R 2 ) = H(M l7 M 2 ) = H(M lt M 2 \U hl = u M ) 
= H(MiM 2 ,X", X 2 , U?\U 1A = u lt i) 
= H(X?,X2,U?\U ltl =u 1A ) 

n 

= Y / H(U lti \Xi-\xt\ui-\M 2 ,U 1 , 1 =u 1A ) 

z=l 

+ H(X 1:i ,X 2ii \Xi-\X 2 [- 1 ,UlM 2 ,U hl = u ltl ) 

n 

>J2 H ( X l,i, X 2,i\Ul,i) 
i=l 

= H[X 1 ,X 2 \U 1 ). (24) 

Let us now define tt u = Pr[J7i = u] and (f> Xl ,x 2 \u — 
Pr[Xi = xi,X 2 = x 2 \Ui — u] for i,j G {0,1} and for 
all ui G {0, 1, u}. Probability conservation implies that the 
relationship (|7]i be satisfied. This concludes the proof. 
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